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ABSTRACT 

We present detailed predictions for the properties of Lya-emitting galaxies in the frame- 
work of the ACDM cosmology, calculated using the semi-analytical galaxy formation model 
GALFORM. We explore a model which assumes a top-heavy IMF in starbursts, and which has 
previously been shown to explain the sub-mm number counts and the luminosity function of 
Lyman-break galaxies at high redshift. We show that this model, with the simple assump- 
tion that a fixed fraction of Lya photons escape from each galaxy, is remarkably success- 
ful at explaining the observed luminosity function of Lya emitters over the redshift range 
3 < z < 6.6. We also examine the distribution of Lya equivalent widths and the broad-band 
continuum magnitudes of emitters, which are in good agreement with the available observa- 
tions. We look more deeply into the nature of Lya emitters, presenting predictions for fun- 
damental properties such as the stellar mass and radius of the emitting galaxy and the mass 
of the host dark matter halo. The model predicts that the clustering of Lya emitters at high 
redshifts should be strongly biased relative to the dark matter, in agreement with observational 
estimates. We also present predictions for the luminosity function of Lya emitters at z > 7, a 
redshift range which is starting to be be probed by near-IR surveys and using new instruments 
such as DAzLE. 

Key words: galaxies:evolution - galaxies:formation - galaxies:high-redshift - galax- 
ies:luminosity function - cosmology:theory 



1 INTRODUCTION 

After an unpromising start, searches for Lya emission are 
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peting in observing efficiency with techniques such as broad-band 
searches for Lyman-break g alaxies. The next ge neration of near- 
infrared instrumentation (e.g. lHorton et al. 12004 will in principle 
allow Lya emitting galaxies to be found up to z ~ 20, permitting a 
probe of the star formation history of the Universe before the epoch 
when reionization is thought to have taken place. 

There are in fact a number of different mechanisms which can 
produce Lya emission from high redshift objects. (1) Gas in galax- 
ies which is photo-ionized by young stars will emit Lya as hy- 
drogen atoms recombine; thi s was originally proposed a s a signa- 
ture of primeval galaxies by [Partridge & Peebles I Il967l) . (2) Gas 
can alternatively be ionized by radiation from an active galactic 
nucleus (AGN). (3) Intergalactic gas clouds are predicted to emit 
Lya recombination radiation due to ionization of the gas by the in- 
tergalactic ultraviolet background (e.g. Hogan & Wevmann 1987; 
Cantalupo et 0/12005). (4) Gas within a dark matter halo which 
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is cooling and collapsing to form a galaxy may radiate much of 
the gravitational collapse energy by collisionally-excite d Lya emis - 
sion (e.g. IHaiman. Spaans & OuataertlEoOol; IFardal et al. Il200lh . 
(5) Finally, Lya can also be emitted from gas which has been 
shock heated by g alactic winds or by jets in radio galaxies (e.g. 
iMcCarthve? al. Il987l) . The majority of high-redshift Lya emitters 
(LAEs) detected so far are compact, and appear to be individual 
galaxies in which the Lya emission is powered by photoionization 
of gas by young stars. Lya surveys have also found another class 
of emitter, the so-called Lya blobs, in which the Lya emission is 
much more extended than individual galaxies, and may be powered 
partly by AGNs or gas coo ling ( Steid elef al. 11200(1 ; iBower et al. I 
2004; Mats uda et al. 1 12004'). We will be focusing in this paper on 
Lya emission powered by young stars, and so will not consider the 
Lya blobs further. 

To date, there has been relatively little theoretical work on try- 
ing to predict the properties of star-forming Lya-emitting galaxies 
within a realistic galaxy formation framework. Haim an & Spaansl 
119991) made predictions for the number of emitters based on the 
halo mass function and using ad-ho c assumption s linking Lya 
emission to halo mass, while iBarton et al\ 120041) made predic- 
tions for very high redshifts (z > 7) based on a gas-dynamical 
simulation. Furlanetto et al. I l2005h used gas-dynamical simula- 
tions to calculate Lya emission both from star-forming objects and 
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from the intergalactic medium in the redshift range < z < 5. 
However, the first calculation of the abundance of Lya emitters 
based on a d etailed hierarchical galaxy formation model was that of 
iLe Delliou et al. 1 120051 hereafter Paper I). In Paper I, we used the 
GALFORM semi-analytical galaxy formation model to predict the 
abundance of star-forming Lya emitters as a function of redshift 
in the cold dark matter (CDM) model. The GALFORM model com- 
putes the assembly of dark matter halos by mergers, and the growth 
of galaxies both by cooling of gas in halos and by galaxy mergers. 
It calculates the star formation history of each galaxy, including 
both quiescent star formation in galaxy disks and also bursts trig- 
gered by galaxy mergers, as well as the feedback effects of galactic 
winds driven by supernova explosions. In Paper I, we found that a 
very simple model, in which a fixed fraction of Lya photons escape 
from each galaxy, regardless of its other properties, gave a surpris- 
ingly good match to the total numbers of Lya emitters detected in 
different surveys over a range of redshifts. We also explored the 
impact of varying certain parameters in the model, such as the red- 
shift of reionization of the intergalactic medium, on the abundance 
of emitters. 

In this paper, we explore in more detail the fiducial model 
of Paper I (based on an Q m — 0.3, spatially flat, ACDM model 
with a reionization redshift of 10). We use the full capability of 
the GALFORM model to predict a wide range of galaxy properties, 
connecting various observables to Lya emission. The galaxy for- 
mation model we use is the same as that proposed by Baug h et al. I 
(2005). A critical assumption of this model is that stars formed in 
starbursts have a top-heavy initial mass function (IMF), while stars 
formed quiescently in galactic disks have a solar neighbourhood 
IMF. We showed in Baug h et al. I that, within the framework of 
ACDM, the top-heavy IMF is essential for matching the counts and 
redshifts of sub-millimetre galaxies and the luminosity function of 
Lyman break galaxies at z — 3 (once dust extinction is included), 
while remaining consistent with galaxy properties in the local uni- 
verse such as the optical and far-IR luminosity functions and galaxy 
gas fractions and metallicities. More detailed comparisons of this 
model with observations of Lyman-break galaxies and of galaxy 
evolution in the IR will be presented in Lacey et al. (2005a, 2005b, 
in preparation). The assumption of a top-heavy IMF is controver- 
sial, but underpins the success of the model in explaining the high- 
redshift sub-mm and Lyman-break galaxies. It is therefore impor- 
tant to test this model ag ainst as many observables as possible. 
Nagas hima et al. I l2005al) showed that a top-heavy IMF seems to 
be required to explain the metal content of the hot intracluster gas in 
galaxy clusters, and Nagashima et al. 1 2005b) showed that a sim- 
ilar top-heavy IMF also seems to be necessary to explain the ob- 
served abundances of a-elements (such as Mg) in the stellar pop- 
ulations of elliptical galaxi es. In the p resent paper, we explore the 
predictions of the Bau gh et al. I 120051 model for the properties of 
Lya-emitting galaxies and compare them with observational data. 
We emphasize that our aim here is to explore in detail a particular 
galaxy formation model which has been shown to satisfy a wide 
range of other observational constraints, rather than to conduct a 
survey of Lya predictions for different model parameters. 

In Section|2| we give an outline of the GALFORM model, fo- 
cusing on how the predictions we present later on are calculated. 
Section|3|examines the evolution of the Lya luminosity function, 
and compares the model predictions with observational data over 
the redshift range 3 < z < 7. In Section|4] we compare a selection 
of observed properties of Lya emitters with the model predictions. 
In Section|5] we look at some other predictions of the model, most 
of which cannot currently be compared directly with observations. 



Section [5] extends the predictions for the Lya luminosity function 
to z > 7. We present our conclusions in Section^. 



2 GALAXY FORMATION MODEL 

We use the semi-analytical model of galaxy formation, GALFORM, 
to predict the Lya emission and many other properties of galaxies 
as a function of redshift. The general methodology and approxima- 
tions behind the GALFORM model are set out in detail in lCole et al. I 
(2000). The particular model that we use in this paper is the same as 
that described by Bau gh et al. I J2005). The background cosmology 
is a cold dark matter universe with a cosmological constant (Q m = 
0.3, «a = 0.7, n b = 0.04, h = iio/lOOkms^Mpc- 1 = 0.7, 
erg = 0.93). Below we review the physics behind the particular 
model predictions that we highlight in this paper. 

The GALFORM model follows the main processes which shape 
the formation and evolution of galaxies. These include: (i) the 
collapse and merging of dark matter halos; (ii) the shock- heating 
and radiative cooling of gas inside dark halos, leading to forma- 
tion of galaxy disks; (iii) quiescent star formation in galaxy disks; 
(iv) feedback both from supernova explosions and from photo- 
ionization of the IGM; (v) chemical enrichment of the stars and 
gas; (vi) galaxy mergers driven by dynamical friction within com- 
mon dark matter halos, leading to formation of stellar spheroids, 
and also triggering bursts of star formation. The end product of the 
calculations is a prediction of the number of galaxies that reside 
within dark matter haloes of different masses. The model predicts 
the stellar and cold gas masses of the galaxies, along with their star 
formation and merger histories, and their sizes and metallicities. 

The prescriptions and parameters for the different processes 
which we use in this paper are identical to [Baugh et ql. | | 2005h . 
Feedback is treated in a similar way to Benson et al. I J2003h : en- 
ergy injection by supernovae reheats some of the gas in galaxies 
and returns it to the halo, but also ejects some gas from halos as 
a "superwind" - the latter is essential for reproducing the observed 
cutoff at the bright end of the present-day galaxy luminosity func- 
tion. We also include feedback from photo-ionization of the IGM: 
following reionization (i.e. for z < z le i on ), we assume that gas 
cooling in halos with circular velocities V c < GOkms^" 1 is com- 
pletely suppressed. We assume in this paper that reionization occurs 
at Zrcion = 10, chosen to be intermediate between the low value 
z ~ 6 suggested by measurements of the Gunn-Peterson trough in 
quasars ( B ecker et al. 1200 It) and the high value z ~ 20 suggested 
by the WMAP measurement of polarization of the microwave back- 
ground iKogutef a/.ll2003h . Our model has two different IMFs: 
quiescent star formation in galactic disks is assumed to produce 
stars with a solar neighbourhood IMF (we use the lKennicutt U 1 9 83l) 
paramerization, with slope x = 0.4 below IMq and x = 1.5 
above), whereas bursts of star formation triggered by galaxy merg- 
ers are assumed to form stars with a top-heavy, flat IMF with slope 
x — (where the Salpeter slope is x = 1.35). In either case, the 
IMF covers the mass range 0.15 < m < 12OA/0. As mentioned 
in the Introduction, the choice of a flat IMF in bursts is essential 
for the model to reproduce the observed counts of galaxies at sub- 
mm wavelengths. The parameters for star formation in disks and 
for triggering bursts and morphological transformations in galaxy 
mergers are given in lBaugh et al. I 120051) . 

The sizes of galaxies are computed as in ICole et al. I boOCh : 
gas which cools in a halo is assumed to conserve its angular mo- 
mentum as it collapses, forming a rotationally-supported galaxy 
disk; the radius of this disk is then calculated from its angular mo- 
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mentum, including the gravity of the disk, spheroid (if any) and 
dark halo. Galaxy spheroids are built up both from pre-existing 
stars in galaxy mergers, and from the stars formed in bursts trig- 
gered by these mergers; the radii of spheroids formed in mergers 
are computed using an energy conservation argument. In calculat- 
ing the sizes of disks and spheroids, we include the adiabatic con- 
traction of the dark halo due to the gravity of the baryonic compo- 
nents. 

Given the star formation and metal enrichment history of a 
galaxy, GALFORM computes the spectrum of the integrated stel- 
lar population using a population synthesis model based on the 
Padova stellar evolution tracks (see iGranato et al. 11200 0. for de- 
tails). Broad-band magnitudes are then computed by redshifting 
the galaxy spectrum and convolving it with the filter response func- 
tions. We include extinction of the stellar continuum by dust in the 
galaxy; this is computed based on a two-phase model of the dust 
distribution, in which stars are born inside giant molecular clouds 
and then leak out into a diffuse dust medium (see Granato et al. 
2000, for more details). The optical depth for dust extinction of 
the diffuse component is calculated from the mass and metallicity 
of the cold gas and the sizes of the disk and bulge. We note that 
the extinction predicted by our model in which the stars and dust 
are mixed together is very different from what one obtains if all 
of the dust is in a foreground screen (as is commonly assumed in 
other theoretical models). Finally, we also include the effects on 
the observed stellar continuum of absorption and scattering of ra- 
diation by intervening neutral hydrogen along the line of sight to 
the galaxy; we calculate this IGM attenuation using the formula of 
Madai^ Il995t) . which is based on the observed statistics of neutral 
hydrogen absorbers seen in quasar spectra. 

We compute the Lya luminosities of galaxies by the following 
procedure: (i) The model calculates the integrated stellar spectrum 
of the galaxy as described above, based on its star formation his- 
tory, and including the effects of the distribution of stellar metal- 
licities and of variations in the IMF. (ii) We compute the rate of 
production of Lyman continuum (Lyc) photons by integrating over 
the stellar spectrum, and assume that all of these ionizing photons 
are absorbed by neutral hydrogen within the galaxy. We assume 
photoionization equilibrium applies within each galaxy, producing 
Lya p hotons according to case B recombination (e.g. Osterbrock 
1989). We note that for solar metallicity, 1 1 times as many Lyc and 
Lya photons are produced per unit mass of stars formed for our top- 
heavy (burst) IMF as compared to our solar neighbourhood (disk) 
IMF. (iii) The observed Lya flux or luminosity of a galaxy depends 
on the fraction / osc of Lya photons which escape from the galaxy. 
Lya photons are resonantly scattered by neutral hy drogen, a nd ab- 
sorbed by dust. Early estimates of this process (e.g. lCharlot & Fall 
1 1 99 ll> showed that only a tiny fraction of Lya photons should es- 
cape from a static neutral galaxy ISM if even a tiny amount of 
dust is present. Many star-forming galaxies a re nonetheless ob- 
served to have sig nificant Lya luminosities (e.g. Kun th et al. 1 1998k 
Pettini et al. 1120011) . and this is generally ascribed to the presence 
of galactic winds in these systems, which allow Lya photons to es- 
cape after many fewer resonant scatterings. Radiative transfer cal- 
culations of Lya through winds have shown that this process can 
explain the asymmetric Lya line profiles which are typically ob- 
served (e.g. Ahn 2004). The effects of radiative transf er of Lya 
through clu mpy dust and gas h ave been considered by Neufeld 
<199ll) and lHansen & Oh1 120051) . 

Calculating Lya escape fractions from first principles is 
clearly very complicated, and so we instead adopt a simpler ap- 
proach. In Paper I, we found that assuming a fixed escape frac- 
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Figure 1. The predicted evolution with redshift of the cumulative Lya lu- 
minosity function, denned as the comoving number density of galaxies with 
Lya luminosities brighter than L^ ycl . The model predictions are shown for 
selected redshifts in the interval z = to z = 7. 

tion /osc for each galaxy, regardless of its dust properties, resulted 
in a surprisingly good agreement between the predicted number 
counts of emitters and the available observations. In that paper, we 
chose / csc = 0.02 to match the number counts at 2 ~ 3 at a flux 
/ ~ 2x 10~ 17 erg s _1 , and we use the same value of / csc in this pa- 
per. Although this extreme simplification of a constant escape frac- 
tion may seem implausible, it does give a reasonably good match 
to the observed Lya luminosity functions and equivalent widths at 
different redshifts, as we show in the next sections. 

Our calculations do not include any attenuation of the Lya 
flux from a galaxy by propagation through the IGM. Lya pho- 
tons can be scattered out of the line-of-sight by any neutral hy- 
drogen in the IGM close to the galaxy. If the emitting galaxy is 
at a redshift before reionization, when the IGM was still mostly 
neutral , this could in pr inciple strongly suppress the observed Lya 
flux jMiralda-Escude 1998). However, various effects can greatly 
reduce the amount of attenuation: ionization of the IGM around 
the galaxy iMadau & Rees 2000; Haiman 2002), clearing of the 
IGM by galactic winds, gravitational infall of the IGM towards the 
galaxy, and redshifting of the Lya emission by scattering in a wind 
I Santos 2004). In any case, since measurements of Gunn-Peterson 
absorption in quasars show that reionization must have occured at 
z > 6.5, attenuation of Lya fluxes by the IGM should not affect 
our predictions for z < 6.5, but only our predictions for very high 
redshifts given in Section|6| 



3 EVOLUTION OF THE LYa LUMINOSITY FUNCTION 

A basic prediction of our model is the evolution of the luminosity 
function of Lya emitters with redshift. This depends on the distri- 
bution of star formation rates in quiescent and starburst galaxies 
(with solar neighbourhood and top-heavy IMFs respectively), and 
on the metallicity with which the stars are formed. Paper I showed 
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Figure 2. The evolution of the cumulative Lya luminosity function with redshift, comparing models with observational data. Each panel corresponds to a 
different redshift, as indicated by the legend. The curves show model predictions, while the lines with symbols and symbols with error bars show observational 
data. The solid curves show the predictions for the total luminosity function for our standard model, with a top-heavy IMF in bursts and / csc = 0.02, while 
the dotted and short-dashed lines show the separate contributions of starbursts and quiescently star-forming galaxies respectively (in most cases the solid and 
dotted lines overlap). The long-dashed curves show the predicted total luminosity function for a variant model with a universal Kennicutt IMF an d f eBC = 0.2. 
The re ferences f or the observational data (as shown in the symb ol key) a re as follows: Kudri tzki -fKudritzki et al. i2000); Cowi e & Hu - IC owie & Hul 
11998ft: CADIS -iMaier et al. 1 120031): Ouchi -lOuchi et al\ 120031) : Santos - ISantos et aU |2004|) : Aiiki - lAiiki et al. I l2003l) ~Hu - iHu et al. 1 12004|) : LAL~ 
Rhoads et al. I fcOOfl : Taniguclu TTanTgucniT?a^l2l)0^K The redshifts for the observational data are close to that of the model plotted in each panel, but do 
not exactly coincide in all cases. In most cases, the data are plotted as stepped histograms, with each step corresponding to a single galaxy. 



predictions for the cumulative number counts of emitters per unit 
redshift as a function of observed Lya flux. Here we focus on a 
closely related quantity, the cumulative space density of emitters as 
a function of Lya luminosity at different redshifts. FigQshows the 
cumulative luminosity function of Lya emitters predicted by our 
standard model for a set of redshifts over the interval z = — 7. The 
model luminosity function initially gets brighter with increasing 
redshift, peaking at z = 3, before declining again in number den- 
sity at even higher redshifts. The increase in the luminosity function 
from z — to z ~ 3 is driven both by the increase in galaxy star 



formation rates, and by the increasing fraction of star formation oc- 
curin g in bursts (which have a top-heavy IMF). As shown in Fig.l 
in lBaugh et al. I 120051) . the model predicts that the fraction of all 
star formation occuring in bursts increases from ~ 5% at z — to 
50% at 2 ~ 3.5 and then to ~ 80% at z > 6. 

We compare the model predictions with observational esti- 
mates of the cumulative Lya luminosity function in Fig|2| where 
we show different redshifts in different panels. The observational 
estimates of the luminos i ty fun ctions which we plot have been 
calculated by iTran et al. I J2004) (and also S. Lilly, private com- 



The properties of Ly a- emitting galaxies in hierarchical galaxy formation models 5 



munication, for z = 6.55) from published data on surveys for 
LAEs, assuming the same cosmology as we assume in our mod- 
els. The observed luminosity functions in the figures are labelled 
according to the survey from which they were obtained. In each 
case, surveys using narrow-band filters were used to find can- 
didates for LAEs at particular redshifts, and then either broad- 
band colours or follow-up spectroscopy were used to determine 
which of the candidates were likely to be real Lya emitters, 
and which were likely to be lower-redshift interlopers resulting 
from other emission lines falling within the narrow-band filter re- 
sp onse. The surveys with spectroscopic follow - up which we plo t 
arelKudritzki et al. IfcOOOh . lCowie & HulJl998t).lHu et al. N2004 . 
Rhoa ds et al. I 120031 LALA) and iTaniguchi et al. I <2005l) . while 
the surv eys using only col our selection a re Maiere/a/. 1 2003 , 
CADIS) Jouchi el ■ aZlbOOft andlAiiki et al. N2003h . We also show 
the data of Tslmto^razT72004l) at z w 5 from a spectroscopic 
survey of gravitationally-lensed fields. The stepped appearance of 
most of the observed luminosity functions results from the small 
number of objects in most of the samples; each step corresponds to 
the inclusion of an additional object as the luminosity is reduced. 
The cumulative luminosity functions cut off at the bright end where 
the observational samples contain only one object of that luminos- 
ity; the statistical uncertainties are correspondingly largest at the 
highest luminosities. For reference, we note that the luminosity dis- 
tance for our assumed cosmology is cLl = (2.0, 3.2, 3.8,4.5) x 
10 4 /i -1 Mpc for z = 3.3,4.9,5.7,6.55 respectively, so that a 
Lya flux of 10 _17 ergcm _2 s _1 at each of these redshifts cor- 
responds to a luminosity (0.5,1.2,1.7,2.4) x 10 42 /i~ 2 erg s -1 
respectively. The lower luminosity limits on the observed lumi- 
nosity functions correspond to roughly the same Lya flux limit 
~ 10~ 17 erg cm _2 s _1 at each redshift. 

In Fig|2| the predictions for our standard model (with a top- 
heavy IMF in bursts and / osc = 0.02) are shown by solid lines. We 
also show the separate contributions of bursting and quiescently 
star-forming galaxies respectively as dotted and short-dashed lines. 
In most cases, the dotted line is barely distinguishable from the 
solid line, showing that the model Lya luminosity function is com- 
pletely dominated by bursts over the range of redshift and luminos- 
ity plotted in Fig|2| Overall, there is broad agreement between the 
predicted and observed luminosity functions over the redshift range 
z = 3 — 6.6. This is remarkable, since we allowed ourselves to ad- 
just only one model parameter to fit the observational data, namely 
the Lya escape fraction / csc . The model luminosity functions do 
not perfectly match all of the observational data, but where there 
are differences between the model and observational data, there 
are also equally large differences between different observational 
datasets. The differences between different observational datasets 
could be due to a combination of (a) statistical fluctuations (most 
of the samples are small), (b) field-to- field variance due to galaxy 
clustering (e.g. Ouchi et al 2003; Shimasa ku et al. 120031) . (c) dif- 
ferences in the details of how the samples are selected (e.g. differ- 
ences in the equivalent width limit or photometric criteria applied), 
and (d) differing levels of contamination by objects which are not 
Lya emitters. We note that the model predictions shown in Fig|2| 
do not include any limit on the equivalent width (EW) of the Lya 
emission line, while the observational data shown all incorporate 
different lower limits on the EW of line emission as well as on the 
line flux. However, as we show in £14.11 these EW thresholds are 
predicted not to significantly affect the comparison of model and 
observed luminosity functions in Fig|2| 

The value of the Lya escape fraction which we find fits the 
data, / csc = 0.02, is quite small. This is mainly because, over the 



range of redshift and luminosity probed by the observations, the 
counts of objects in our standard model are dominated by bursts, 
and we have assumed a top-heavy IMF in bursts. As noted above, 
the Lya luminosity for a given star formation rate is about 10 
times larger with the top-heavy IMF than with a solar neighbour- 
hood IMF. We also show in Fig|2|by long-dashed lines the predic- 
tions of a variant model, in which we assume the same Kennicutt 
IMF for bursts and quiescent star formation, and with / csc = 0.2. 
Even though we have chosen / osc for this variant model to pro- 
vide the best overall match to the observational data in Fig|2| we 
see that it agrees somewhat less well with the data than does our 
standard model, especially at z = 3.3, where it predicts more 
low-luminosity galaxies. Moreover, this variant model dramatically 
under-predicts both the counts of sub-mm galax ies and the nu mber 
of Lyman-break galaxies (see Fig. 5(a) in Baug hef al. 1120051) ). 

We have also investigated the effect on the predicted lumi- 
nosity functions of changing the IGM reionization redshift from 
our standard value Ztelon = 10. As described in Sj2| this affects 
galaxies in our model through photo-ionization feedback. For our 
standard model, we find that varying z rc ion over the range 6.5 to 
20 changes the luminosity function by less than the scatter between 
different observational datasets in Fig|2| over the range of luminos- 
ity and redshift probed by those data. Choosing a different value of 
Zrcion in this range would therefore not significantly affect any of 
the conclusions we draw her e. 

iFurlanetto et al. I l2005f) have computed luminosity functions 
of Lya emitters from a numerical simulation, including emission 
from gas heated by shocks and by the intergalactic ionizing back- 
ground as well as emission from star-forming regions in galaxies. 
They assume that stars all form with a Salpeter IMF. However, 
the luminosity functions which they compute combine all of the 
emission from each dark matter halo, and so are different from 
the luminosity functions of individual galaxies which our model 
predicts. Furthermore, they effectively assume an escape fraction 
/csc = 1 for Lya emission from star formation. They do not make 
any detailed comparison with observational data on Lya-emitting 
galaxies, but note that their luminosity functions predict roughly an 
order-of-magnitude more objects than are observed over the range 
Lhya ~ 10 42 — 10 43 erg s _1 . This is roughly consistent with what 
we would find if we assumed a Kennicutt IMF for all star formation 
and /csc = 1. 



4 OBSERVABLE PROPERTIES OF LYa EMITTERS 

Now that we have established that our model gives a very good 
match to the luminosity function of Lya emitters at different red- 
shifts, we turn our attention to other observable properties of these 
objects. We first present predictions for the distribution of Lya 
equivalent widths (§4.1), before examining the broad-band contin- 
uum magnitudes of Lya emitters (§4.2) and, finally, the size distri- 
bution of emitters (§4.3). 

4.1 Lya equivalent widths 

Our model allows a simple prediction for the equivalent width 
(EW) of the Lya emission line in each galaxy: we divide the lumi- 
nosity in the emission line by the mean luminosity per unit wave- 
length of the stellar continuum on either side of the line. We distin- 
guish between the net and intrinsic line and stellar luminosities and 
equivalent widths. The net values are obtained after we multiply the 
Lya luminosity by the escape fraction / CS c and after we attenuate 
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the stellar luminosity by dust extinction, while the intrinsic values 
are those before we include either the Lya escape fraction or dust 
attenuation. A limitation of our current model is that it does not 
include the effects of absorption of Lya, so the equivalent widths 
we calculate are always positive (or zero). Lya absorption features 
(corresponding to negative equivalent widths) could be produced 
either by absorption in stellar atmospheres ICharlot & Fall 1 19931) . 
or by neutral gas within the galaxy or in an expanding shell or wind 
around it fTenorio-Tagle et al. II 19 99). Our calculations are there- 
fore incomplete, but nevertheless represent an important first step. 

Fig|3| shows the model predictions for rest-frame equivalent 
widths of Lya-emitting galaxies at z=3. The most remarkable fea- 
ture of these plots is the wide spread of EWs predicted by the 
model. This is seen most clearly in the middle panel, which shows 
the distribution of equivalent widths for galaxies selected to have 
Lya fluxes in the range 10~ 17 < / < 10 _16 erg cm~ 2 s _1 . We see 
that there is a big difference between the distributions of intrinsic 
and net EWs (shown by dashed and solid lines respectively). For 
this flux range, the intrinsic rest-frame EW has a median value of 
130 A, and most galaxies have EWs in the range 100-200A. These 
values are similar to the predictions o f lCharlot & Falll <1993l) . The 
spread in intrinsic EWs results mostly from the spread in burst ages 
and timescales. For the same galaxies, the net EWs have a much 
lower median value, 33A, but with a much broader distribution, 
with a peak close to and a tail extending up to ~ 400A. Since 
our model assumes that all galaxies have the same escape fraction 
for Lya, this broad distribution of net EWs results from the wide 
spread in values of dust extinction for the stellar continuum. The 
Lya escape fraction reduces the net EW relative to the intrinsic 
value, but dust extinction of the stellar continuum increases it. 

The upper panel of Fig|5| shows the median and 10-90 per- 
centile range for the EW of Lya as a function of the net Lya flux. 
There is a weak trend of EW increasing with Lya flux (or luminos- 
ity). In the case of the intrinsic EW, this increase is driven mostly by 
the shift from being dominated by quiescently star-forming galax- 
ies (with a normal IMF) at low luminosities to being dominated 
by bursts (with a top-heavy IMF) at high luminosities, and by the 
change in the typical star formation history. For the net EW, the 
increase in the median is driven also by the increase in the typical 
dust extinction of the stellar continuum with increasing luminosity. 

The Lya equivalent widths predicted by our model are sim- 
ilar to those f ound in observed ga laxy samples selected by t heir 
Lya emission. lCowie & Hu1<1998l) and lKudritzki et al. 1 1200(3) se- 
lected LAEs having Lya fluxes ~ 10 -17 — 10 _16 erg cm _2 s _1 
at z — 3.4 and z = 3.1 respectively. In both cases, their narrow- 
band selection imposed a lower limit on the rest-frame EW « 20A 
for the detected objects, and the median rest-frame EW of the ob- 
jects above this threshold was found to be ~ 40A. This appears 
broadly compatable with the predictions shown in Fig|5| once one 
allows for the fact that the EW threshold in the observed samples 
will raise the median EW above the value expected in the absence 
of any EW threshold. At a higher redshift, z ~ 4.5. lDawson et al\ 
(2004) selected LAEs with / - 10~ 17 - 10" 16 erg cm" 2 s _1 and 
EW(rest) > 15A, and measured a median .EW^rest) « 80A for 
their sample. This is also in good agreement with our model, which 
predicts a median EW(rest) as 80A for LAEs with 10" 17 < / < 
10~ 16 ergcm~ 2 s~ 1 at this redshift. 

Shaple vef al. 1 12003}) have measured Lya emission and ab- 
sorption profiles and EWs in a sample of galaxies at z ~ 3 se- 
lected using the Lyman-break technique. Their sample is thus se- 
lected on rest-frame far-UV stellar continuum luminosity, rather 
than on the presence of a strong Lya emission line. They find that 
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Figure 3. The predicted rest-frame equivalent width (EW) of the Lya 
emission line for galaxies at 2 3. We show results both for the intrin- 
sic EW, i.e. before including attenuation by neutral gas and dust in the 
galaxy, and for the net EW, i.e. after including the escape fraction for Lya 
photons and dust extinction of the stellar continuum - these are shown by 
dashed and solid lines respectively. In either case, the EW is considered 
as a function of the net Lya flux, (a) The upper panel shows the predicted 
median EW as a function of the net Lya flux. The error bars show the 
10-90 percentile range at a given flux, (b) The middle panel shows the pre- 
dicted distribution of EWs for galaxies with net Lya fluxes in the range 
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(c) The lower panel shows the predicted 



EW distribution for galaxies selected to have continuum magnitudes in the 
range 23 < R A B < 25.5. 
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Figure 4. The effect of different equivalent width thresholds on the pre- 
dicted luminosity function of Lya emitters. In each panel, the lines show 
the predicted cumulative luminosity function for different lower limits on 
the rest-frame EW of Lya emission: EW> OA (solid curves), EW> 2fjA 
(dotted), EW> 4fjA (short-dashed), and EW> 100A (long-dashed), (a) 
2 = 3. (b) 2 = 6. 



~ 30% of their galaxies show Lya only in emission, ~ 30% of 
galaxies show Lya only in absorption, and ~ 40% show a com- 
bination of Lya absorption and emission. They find a very asym- 
metric and skewed distribution of Lya rest-frame EW's, with a me- 
dian close to OA, extending down to ~ — 50A for net absorption 
and to > 100A for net emission. For the galaxies with net Lya 
emission, the median EW is ~ 20A. (For galaxies with net absorp- 
tion, the median EW is ~ — 20A.) The lower panel of Fig|3|shows 
the EW distribution predicted by the model if we select galaxies 
in a similar way to IShaplev et al. L with a continuum magnitude 
range 23 < Rab < 25.5 (including dust extinction) and no con- 
dition on the Lya flux or EW. The model predicts a median EW 



« 20A for this case, very simi lar to the typic al EW of the emis- 
sion component of Lya in the IShaplev et al. I sample. The shape 
of the model EW distribution (w hich is restricte d to EW ^ 0) is 
also very similar to that found by Sha plev et al. I for EW > (see 
their Fig. 8). However, without including a calculation of Lya ab- 
sorp tion in our mo del, we cannot make a more detailed comparison 
wifh lShaplev et al\ Since a calculation of Lya absorption requires 
a treatment of radiative transfer through the galaxy ISM, we defer 
this to a future paper. 

Since our model allows us to estimate Lya EWs, we can 
also estimate the effect on the Lya luminosity function of im- 
posing different lower limits on the EWs of Lya emission from 
galaxies. This is shown in Fig|4| for rest-frame EW thresholds 
EW mln (rest) = 0,20,40 and 100A, for redshifts z = 3 and 
2 = 6. Different observational surveys for LAEs impose differ- 
ent lower limits on the EWs of the objects they include. However, 
for most of the observational data plotted in Fig|2| the lower limit 
is around i?W / m i n (rest) = 20A. We see from Fig|4]that an EW 
threshold around this value is predicted to have only a small ef- 
fect on the Lya luminosity function, so the conclusions we drew 
from the comparison with observational data in Fig|2|would not be 
significantly affected. 

4.2 Broad-band magnitudes 

Another important test for our model of the Lya emitters is that it 
should predict the correct stellar continuum as measured in broad- 
band filters. Fig|5| shows the model predictions for the median 
broad-band magnitudes as a function of Lya flux at three differ- 
ent redshifts, z = 3, 5.7 and 6.55, and for three different broad- 
band filters, the I c , i' and z' filters on the Suprime Cam on the 
Subaru Telescope. (We chose these particular filters because most 
of the observational data we will compare with were taken with 
them.) The predicted broad-band magnitudes include the effects of 
du st extinction a nd of attenuation by the intervening IGM (based 
on Madau 1995). The evolution with redshift of the predicted I c 
and i' magnitudes at a given Lya flux which is seen in Fig[5]results 
mostly from the IGM opacity. In some cases, the Lya emission 
line falls within the bandpass of the filter. We have therefore com- 
puted broad-band magnitudes due to either the stellar continuum 
only (shown by dashed lines), or to the stellar continuum and Lya 
emission line together (shown by solid lines). In most of the cases 
plotted in Fig|5| the solid and dashed lines are indistinguishable, 
but in a few cases there is a small offset, showing that the Lya line 
makes a modest contribution to the broad-band magnitude in these 
cases. 

For comparison, we also plot in Fig[5] a selection of obser- 
vatio nal data for galax i es at z ~ 5 — 7 f rom the following pa- 
pers :lEllis et al. IfeOOlli.lAiiki et al. I <2003l) . IStanwav et al. 1 1 2004) 
and Tanigu chieTaTlTzOOSl) . The data we plot constrain the stel- 
lar continu um at wave len gths ~ 900 — 14 00A in the galaxy rest- 
frame. The Elli s et al. l and lStanwav et al. I data were actually taken 
on HST using the WFPC2 (Igi 4 filter) and ACS (i' and z filters) 
cameras respectively, but we have verified that the difference of 
these filters from the Subaru filters (in particular, the difference of 
7814 from I c ) does not significantly affect the comparison of mod- 
els with data which we make here. In cases where the observational 
papers have tried to correct the broad-band magnitudes for the con- 
tribution from the Lya line, we have plotted the total magnitude 
before this correction was made. We plot the observational data in 
Fig|5|as symbols of th e same co lour as the model curve closest in 
redshift. Apart from lEllis et al. I all of the samples contain more 
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Figure 5. Broad-band magnitudes as a function of Lya flux. The lines show 
the model predictions for three different redshifts, z = 3, 5.7 and 6.55, in 
different colours. The dashed lines show the median magnitude due to the 
stellar continuum only, while the solid lines include also the contribution 
of the Lya line to the broad-band magnitude. Where the dashed line is not 
visible, it coincides with the solid line. The error bars on the lines show 
the 10-90 percentile range. For clarity, small offsets in the x-direction have 
been applied to the model relations for different redshifts. The top, mid- 
dle and lower panels show results for the I c , i' and z' filters respectively. 
The symbols show observational data, plotted in the same colours as the 
mo del curve closest in redshift. The observational data are as follows: El- 
lis - lEllis et al. IfeOOll) (1 galaxy ); Ajiki - lAiiki et al. 1120031) (20 galaxies'): 
Starrway~Tstorwav^rQ/n72004l) (3 galaxies); Taniguchi - ITaniguchi et al\ 



1 2005) (9 galaxies). For samples with > 1 galaxy, we plot an estimate of 
the median Lya flux and broad-band magnitude, and of the 10-90 percentile 
ranges in both (shown by error bars). 



than one galaxy, and in these cases we estimate the median and 
10-90% percentile range for both the Lya flux and the broad-band 
magnitude (allowing for upper limits on the broad-band fluxes). We 
plot the symbol at the median value and show the 10-90% range by 
error bars. If the 10% value is an upper limit, we indicate this by a 
very long downwards error bar. 

We see from Fig[5] that there is mostly good agreement be- 
tween the predicted broad-band magnitudes and the observational 
data. The one exception is th at the median i' magnitude measured 
from ITaniguchi et al. 1 120051) for galaxies at z = 6.55 is nearly 
3 magnitudes brighter than what our model predicts, even though 
the z' magnitudes for the same observational sample agree very 
well with the model predictions. However, 7 out of 9 objects in the 
ITaniguchi et al. I sample are detected in the i' band at less than 2a 
significance, so it is possible that the symbol marking our estimate 
of their median magnitude is biased high by statistical errors. We 
note that at redshift z = 6.55, the i'-band flux is sensitive to emis- 
sion at wavelengths ~ 900 — HOOA in the galaxy rest-frame, so 
it is expected to be greatly attenuated by Lya absorption by neu- 
tral hydrogen in the intervening IGM. In contrast, the flux in the z 
band at this redshift is expected to be much less affected by IGM 
attenuation. Thus, an alternative possible explanation for the dis- 
agreement between the model and the ITaniguchi et al. I i' data is 
that we over-estimate the degree of IGM attenuation at this redshift 
when we calculate it using the M adaulJl993) formula. 



4.3 Sizes of Lya emitters 

Our semi-analytical model predicts the half-mass radii for the disk 
and bulge components of each galaxy. From these we can compute 
the half-mass radius of the stars, and also half-light radii in differ- 
ent bands, allowing for different colours of the disk and bulge, but 
assuming that both components have internally uniform colours. 
Coenda et al. (2005, in preparation) will present predictions from 
our model for the sizes of galaxies selected by their stellar contin- 
uum emission, and compar e with obse rvational data over the red- 
shift range < z < 6. ICole et al \ l2000h have discussed pre- 
dictions for galaxy sizes at z = based on an earlier version of 
our semi-analytical model. In the present paper, we will only con- 
sider the sizes of galaxies selected to be Lya emitters. We empha- 
size that w e are not con sidering in this paper the properties of Lya 
blobs (e.g. Mats uda et al. lE004l) . which are much more spatially 
extended than typical Lya emitters, and appear to be a distinct class 
of object. 

In Fig[6ja), we show model predictions for the median stel- 
lar half-mass radius (together with its 10-90 percentile range) as a 
function of Lya luminosity for several different redshifts, z — 3, 
5, 6 and 7. We have also calculated model half-light radii in the 
rest-frame UV, and the results are almost identical to those for 
the half-mass radius for these redshifts and luminosities. The stel- 
lar sizes are predicted to be quite compact at these redshifts, < 
l/i _1 kpc. We see that, as well as a correlation of size with luminos- 
ity (roughly as R1/2 oc L 1//3 ), the models also predict that the me- 
dian radius at a given luminosity should decrease with increasing 
redshift (roughly as (l + z)" 1 or (l + z)~ 1,5 ). At a fixed Lya lumi- 
nosity of 10 42 ' 5 ft _2 erg s _1 (the typical value in the higher-redshift 
surveys shown in Fig|2j, the median half mass radius shrinks from 
~ lft _1 kpc at z — 3 to ~ 0.5/i _1 kpc at z — 7. 

We have also plotted in Fig|6ja) some observational estimates 
of sizes for individual Lya-emitting galaxies at z ~ 5 — 7, plot- 
ted in the same colour as the model curve clo sest in redshift, for 
three different samples: IStanwav et al. I 120041) used HST to mea- 
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sure h alf-light radii in the rest-frame UV of 3 LAEs: lEUis et al. I 
1200 ll) used HST to measure the rest-fra me UV size of a single 
strongly gravitationally lensed LAE; and ITaniguchi et al\ 12005) 
used ground-based narrow-band imaging to estimate the sizes of 
the Ly q-emitting region in 9 LAEs. We see that the St anwav et al. I 
12004) data agree very well with our model predictions, but the 
Ellis et al. ( 200]]) galaxy is much smaller than the median pre- 
dic ted by our model at that luminosity and redshift. However, 



the Elli^^/. 



IStanwav et al. 



datapoint is more uncertain than those of 
because it relies on the analysis of a highly 
gravitationally amplified and distorted image. We also see that the 
sizes of the Lya-emitting regions estimated by Taniguchi et al. 
(2005) are typically ~ 2 times larger than the model prediction 
for the stellar half-mass radius at the same luminosity. This might 
be because the Lya emission in these high-redshift LAEs really is 
more extended than the stellar distribution. This has been found to 
be the case in some local starburst galaxies by Mas-Hess e et al. I 
(2003), who explain this as resulting from scattering of Lya by 
neutral gas aroun d these galaxies. Alternatively, it is possible that 
iTaniguchi et al. I have over-estimated the sizes of their galaxies, 
which are barely spatially resolved in their ground-based images. 

In Fig[(Jb) we show predictions for the angular sizes of Lya- 
emitters as a function of Lya flux, for redshifts over the range 
z = 3 — 20. (We again use the stellar half-mass radius as our 
measure of the size.) We see that the relation between angular size 
and flux evolves rather little with redshift, even though the relation 
between physical size and luminosity does evolve appreciably. Pre- 
dicted angular sizes are typically ~ 0.1 arcseconds for fluxes in the 
observed range. 



5 PREDICTED PHYSICAL PROPERTIES OF LYa 
EMITTERS 

One of the main strengths of semi-analytical modelling lies in the 
ability of the models to make predictions for a wide range of galaxy 
properties. Some of these predictions can be tested directly against 
observations, as we saw in the previous section. Others can be 
tested indirectly, for example through the interpretation of mea- 
surements of clustering. Finally, some predictions serve to illustrate 
how a subset of galaxies highlighted by a particular observational 
selection fit into the overall galaxy population. In this section we 
present some additional predictions of the model that characterize 
the Lya emitters. 

Fig0 shows model predictions for different physical proper- 
ties as a function of Lya luminosity. In each panel, we show the 
predictions at z = 3, 5 and 7. We plot the median value of the re- 
spective quantity and indicate the spread in the predicted values by 
showing the 10-90 percentile range, apart from the plot of cluster- 
ing bias, where we show only the mean value. 
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Figure 6. The sizes of Lya emitters, (a) The top panel shows the physical 
sizes of Lya emitters as a function of their Lya luminosities. The lines show 
model predictions for the median stellar half-mass radius for four different 
redshifts, z = 3, 5, 6 and 7. The error bars show the 10-90 percentile range 
at a given luminos ity. The sym bols show obser vation al es tima tes of galaxy 
radii, f rom rEllis et al. I l200ll) . IStanwav et al.\ <2004 and ITaniguchi et al. I 
(2005), plotted for individual galaxies in the same colours as the model 
curve closest in redshift (see text for more details), (b) The lower panel 
shows predicted angular sizes as a function of Lya flux, for the redshifts 
z = 3,6, 8, 10, 15 and 20. The sizes plotted are again stellar half-mass 
radii. 
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5.1 Halo masses 

The upper left panel of Fig0shows the masses of the dark matter 
halos hosting Lya emitters. We see that at z = 3, there is only a 
weak dependence of median halo mass on Lya luminosity, while 
at z = 7, the dependence is much stronger. At a given luminos- 
ity, the typical halo mass decreases with increasing redshift, with 
this trend being stronger at lower luminosities. As discussed in 
' \5. 51 below, the Lya luminosity traces the instantaneous star for- 
mation rate (SFR) quite well in our model, but with a ratio which 
is ~ 10 times larger for bursting compared to quiescent galaxies, 
because of the difference in IMFs. There are two main reasons for 
the weak dependence of halo mass on Lya luminosity at the lower 
redshifts: (a) the bursts introduce a large scatter into the relation 
between instantaneous SFR and object mass, especially at lower 
redshifts; (b) the shift from being dominated by bursts at high Lya 
luminosities to being dominated by quiescent disks at lower lumi- 
nosities flattens the trend of SFR with Lya luminosity, which tends 
to hide the underlying trend of halo mass with SFR. Current Lya 
surveys probe objects with luminosities ~ 10 42 ' 5 /i _2 ergs _1 over 
the whole redshift range z ~ 3 — 7, for which the typical halo mass 
~ 10 11 ft -1 Mq, declining by a factor ~ 3 from z = 3 to z — 7. 

5.2 Clustering bias 

The halo masses of LAEs can be constrained observationally from 
measurements of their clustering. Since the predicted halo masses 
for typical observed LAEs are larger than the characteristic halo 
mass at each redshift, we expect the LAEs to be more strongly 
clustered than the dark matter. We have used the halo masses pre- 
dicted by the model to calculate the linear clustering bias b, which 
is expected to describe the clustering o n large scales, using the for- 
mula of ISheth. Mo & Tormenl 1200 ll) . We calculate a mean bias 
for objects in each range of luminosity. This mean bias is shown in 
the upper right panel of FigQ Over the luminosity range L^ya = 
10 40 — 10 43 /i _2 erg s _1 , our model predicts that the bias increases 
with redshift at fixed luminosity. At z — 3, we predict b ~ 2 over 
this luminosity range, increasing only very slightly with luminos- 
ity. At z = 7, the bias is predicted to vary much more strongly 
with luminosity, from b « 3 at Lh ya = 10 40 /i _2 erg s _1 to b « 6 
at 10 43 /i -2 erg s 1 . These predictions for the clustering of LAEs 
seem generally consistent with current observational constraints. 
The most reliable measurement of the clustering of LAEs to date 
is probably that of O uchi et al. I (2005), since their sample covers 
by far the largest comoving volume. For galaxies at z — 5.7 with 
Lya luminosities ~ 10 42 /i _2 erg s _1 , they find a large-scale clus- 
tering bias b — 3.4 ± 1.8. For the same redshift and luminosity, our 
model predicts b ~ 4, in excellent agreement with this measure- 
ment. At somewhat lower redshifts, z ~ 4.8, somewhat conflicting 
results have been obtained for the clustering (e.g. Ouchi et al 2003 ; 
IShimasaku et al. 2003, 2004 ; H amana et al. 120041) . however, these 
have been o btained fro m muc h smaller survey volumes. In partic- 
ular, [shimasaki^nji] 120041) measure very different clustering in 
their two approximately equal survey volumes at z — 4.8, showing 
that the surveys used were not large enough to reliably measure the 
average clustering. 

5.3 Stellar masses 

The middle left panel of FigQshows the predicted stellar masses of 
LAEs as a function of Lya luminosity. The trends of stellar mass 
with luminosity and redshift are similar to those already discussed 



for the halo mass: the trend of stellar mass with luminosity steep- 
ens with increasing redshift, and the mass at a given luminosity 
decreases with increasing redshift. The reasons for the rather flat 
trend of stellar mass with Lya luminosity at the lower redshifts are 
the same as those already given for the halo mass. For a luminosity 
Lhya = 10 42 ' 5 /i _2 erg s _1 , the median stellar mass is predicted to 
decrease from ~ W 9 h~ 1 M Q at z = 3 to ~ 3 x W 7 h~ 1 M Q at 
z = 7. 

There have not yet been any observational estimates of the 
stellar masses of LAEs, but the values predicted by our model 
are rather lower than the stellar masses inferred observationally 
for so me other classes of high-redshift galaxies. Shaple v et al. I 
(2001) estimated stellar masses of Lyman-break galaxies at z ~ 3 
from broad-band photometry, and found a median stellar mass ~ 
in a sample with median magnitud e Rab ~ 24 (sim- 
ilar r esults were also found by IPapovich. Dickinson & Ferguson! 
l200ll) . In contrast, typical observed LAEs at z ~ 3 with Lhya ~ 
10 42 fo -2 erg s _1 are predicted by our model to have stellar masses 
~ 10 9 h -1 Mq. The difference could be explained by a combi- 
nation of two effects: (a) The LAEs at z ~ 3 with Ll V u ~ 
10 42 /i~ 2 erg s^ 1 t ypically have fa inter continuum magnitudes (by 
1-2 mag) than the Shaplev et al. LBGs. (b) The photometric esti- 
mates of the stellar masses of LBGs depend strongly on the IMF 
assumed, because the ma ss-to-light ratio of a stellar population is 
very sensitive to the IMF; Shapl ev et al. I assumed a Salpeter IMF, 
but if instead they had assumed a top-heavy IMF as in starbursts in 
our model, then they might have derived lower masses. 

The issue of how photometric estimates of the stellar masses 
of high-redshift galaxies depend on the assumed IMF is an im- 
portant one, but is also complicated, because these estimates in- 
volve fitting multi-parameter models (varying age, star formation 
history, metallicity and dust extinction) to multi-band photometric 
data, in order to estimate the mass-to-light ratio of the stellar pop- 
ulatio n. Most such studies have simply assumed a Salpeter IMF. 
IPapovich. Dickinson & Ferguson 1 1200 ll) considered the effects on 
photometric mass estimates of varying the lower mass limit on the 
IMF, but did not consider IMF slopes different from the solar neigh- 
bourhood value. We plan to address this issue in more detail in a 
future paper. 

5.4 Metallicities 

The middle right panel of Fig0 shows the metallicity of the cold 
gas in LAEs as a function of Lya luminosity. We see that in most 
cases, the gas metallicity is appreciable, ~ 10 -2 (i.e. comparable 
to solar), even at high redshifts. This reflects the fact that galax- 
ies are able to self-enrich to metallicities ~ Zq even when the 
mean metallicity of all baryons in the universe is much lower than 
this. In our model, the quiescent galaxies are predicted to show 
a well-defined trend of metallicity increasi ng with lu minosity (as 
already found for z = galaxies bv lCole et al. l200d) . which pro- 
duces the decline in metallicity at low luminosities seen in Fig0 
for 2 = 3 and z = 5. However, the bursts show a more com- 
plicated behaviour, with the median metallicity being flat or even 
non-monotonic with luminosity, which is reflected in the behaviour 
seen in Fig0for z — 7 and for higher luminosities at z — 3 and 

2 = 5.^ 

5.5 Star formation rates 

The lower left panel of FigQ shows the instantaneous star forma- 
tion rates in LAEs as a function of Lya luminosity. The star for- 
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Figure 7. Model predictions for a range of physical properties of Lyc* emitters plotted as a function of Lyc* luminosity. In each panel the model predictions 
are shown for z = 3, 5 and 7. The lines show the median values of the respective properties (apart from panel (b) which shows the mean) and the error bars 
show the 10-90 percentile range, (a) The upper left panel shows the mass of the dark matter halo hosting the emitter, (b) The upper right panel shows the mean 
clustering bias, (c) The middlle left panel shows the stellar mass, (d) The middle right panel shows the metallicity of the cold gas. (e) The lower left panel 
shows the total star formation rate, (f) The lower right panel shows the UV continuum luminosity at a rest-frame wavelength of 1500A. 
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mation rates include both the quiescent star formation in galactic 
disks and the contribution of any ongoing starbursts. If we look at 
either quiescent galaxies or bursts separately, then we find a nearly 
linear relation SFR oc LL ya , but with a proportionality constant 
which is ~ 10 times larger for the quiescent galaxies, because 
of the difference in IMFs. For a galaxy which has been forming 
stars at a contant rate for 10 8 yr, we predict a relation Lh ya = 
(1.1,12) x 10 42 ergs _1 fes^SFR/M^yr' 1 ) for the Kennicutt 
and x = IMFs respectively (for solar metallicity), and quies- 
cent and bursting galaxies in our model separately lie quite close 
to one or other relation. In Fig0we see a shallower relation than 
SFR oc LLya in some cases, which results from a gradual transi- 
tion from being dominated by bursts at high luminosities to being 
dominated by quiescently star-forming galaxies at low luminosi- 
ties. For LAEs with luminosities ~ 10 42 ' 5 /i _2 erg s _1 (which are 
dominated by bursts), our model predicts SFRs ~ 10h~ M@yr~ , 
We note that comparing the SFRs predicted by our model with pub- 
lished values estimated from observational data is not straightfor- 
ward, because different authors (a) assume different values for the 
Lya escape fraction (often taking / csc = 1), and (b) assume differ- 
ent IMFs (typically using a Salpeter IMF). 

5.6 UV continuum luminosities 

The lower right panel of Fig0 shows the UV continuum lumi- 
nosity Z/„(1500A) at a fixed rest-frame wavelength of 1500A as 
a function of Lya luminosity. This plot contains similar informa- 
tion to Fig[5] but now, for convenience, in terms of luminosities, 
and at a fixed rest-frame wavelength in the UV. The predicted re- 
lation between UV and Lya luminosities is roughly linear. This is 
what one would expect if one had a universal IMF and no dust at- 
tenuation, since both the UV and Lya luminosities are driven by 
recent star formation. For a galaxy which has been forming stars 
at a contant rate for 10 8 yr, we predict an unattenuated relation 
L„(1500l) = (0.98,3.8) x lO^ergs^Hz-^Si^R/Mayr -1 ) 
for the Kennicutt and x — IMFs respectively, for solar metallic- 
ity. However, dust extinction is predicted to have a large effect on 
UV luminosities in our model, reducing them by a factor ~ 10 in 
the brighter objects. The average UV extinction in the models in- 
creases with luminosity in both bursts and quiescent galaxies, and 
is also larger in quiescent than bursting objects (at a given SFR). 
The unattenuated UV/Lya luminosity ratio is also predicted to de- 
crease by a factor 3 going from quiescent objects at low luminosity 
to bursts at high luminosity as a result of the change in the IMF. 
These effects all combine to leave a relation between UV and Lya 
luminosities which is shifted but still roughly linear. 

The UV and Lya luminosities are both used in estimating star 
formation rates for observed high-redshift galaxies. However, both 
suffer from the drawback that they are affected by large but un- 
certain dust attenuation factors. In the model presented here, the 
dust attenuation is larger by a factor ~ 10 for the Lya than for the 
UV luminosity, so by that criterion, the UV luminosity should be 
a more reliable quantitative star formation indicator. However, we 
caution that the Lya attenuation factor / csc which we use was not 
obtained from a detailed radiative transfer calculation, unlike the 
UV dust extinction. 

6 THE ABUNDANCE OF LYa EMITTERS AT Z > 7 

Paper I presented predictions for the abundance of Lya emitters as 
a function of flux for the redshift range 2 < z < 7 which is acces- 
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Figure 8. Predictions for the number of Lya emitters at very high redshifts. 
(a) The upper panel shows the evolution of the cumulative Lya luminosity 
function at 6 $C z sC 20, for our standard model, with reionization redshift 
2 rcion = 10 an d Lya escape fraction / csc = 0.02. (b) The middle panel 
shows the predicted number counts per unit redshift interval per unit solid 
angle as a cumulative function of the Lya flux, for z r cion = 10. Results are 
shown for two different escape fractions, our standard value / csc = 0.02 
(solid lines) and a larger value / csc = 0.2 (dashed lines). We show number 
counts for selected redshifts falling within either the J,H or K atmospheric 
transmission windows, with different redshifts in different colours, (c) The 
lower panel shows the cumulative number counts as in (b), for our standard 
fesc = 0.02, and three different reionization redshifts, z r cion = 6.5 (dot- 
ted lines), 10 (solid lines) and 20 (dashed lines); where the dotted lines are 
not visible, they coincide with the solid lines. The number counts do not 
include attenuation by the IGM. 
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sible from observations at optical wavelengths. The highest redshift 
at which LAEs have been found in surveys up to now is z = 6.6. 
At even higher redshifts, the Lya line moves into the near-IR as 
seen from the Earth. Therefore, searching for LAEs at z > 7 re- 
quires observing in the near-IR, which is technically challenging. 
Several such search es are underway ("e.g. lWillis & Courbin 1200a : 
Stark & Ellis 2005), and others will start in the near future (e.g. 
Horton et al, 2004). Therefore in this section we present some pre- 
dictions for the number of LAEs which should be found in near-IR 
searches, at redshifts 7 < z < 20. B arton et al. I 120041) have pre- 
viously made predictions for the number of LAEs at z ~ 8 based 
on a numerical simulation, but assumed a 100% escape fraction for 
Lya photons (i.e. / csc = 1). iThommes & Meisenheimerl 120051) 
have also made predictions for z > 7, but for a phenomenological 
model not based on CDM. 

The upper panel of Fig|8| shows what our standard model, 
with reionization redshift z rc ion = 10 and Lya escape fraction 
/csc = 0.02, predicts for the evolution of the luminosity function of 
LAEs at 6 ^ z ^ 20. We see that the luminosity function declines 
significantly at the bright end from z ~ 6 to z ~ 10, and then de- 
clines with redshift very rapidly at all luminosities at z > 10. This 
decline is driven by the reduction in star formation with increasing 
redshift, which results from the build-up of cosmic structure over 
time. 

The middle and lower panels of Fig[S| show the predicted 
number counts per unit solid angle per unit redshift as a cumu- 
lative function of the Lya flux. These predictions do not include 
the attenuation of the Lya flux by neutral gas in the interven- 
ing IGM, as we discuss below. We show predictions for redshifts 
z = 8, 9, 10, 12 and 16, chosen such that the Lya line falls within 
either the J, H or K atmospheric window. (The J, H and K at- 
mospheric windows cover the wavelength ranges 1.08-1.35, 1.51- 
1.80 and 1.97-2.38 /im respectively, corresponding to Lya redshift 
ranges z = 7.9-10.1, 11.4- 13.8 and 15.2- 18.6.) Ground-based 
searches in the near-IR are likely to concentrate on these atmo- 
spheric windows, because the atmospheric opacity at other near-IR 
wavelengths is extremely high. We have already shown predicted 
angular sizes for these galaxies in Fig[6jb). 

The middle panel of Fig|8| shows how the predicted number 
counts depend on the assumed Lya escape fraction / csc . We show 
results for our standard value of the reionization redshift z t ei<m = 
10, but for two values of / csc , 0.02 (our standard value) and 0.2. 
We recall that the value of / esc was originally chosen in order to 
match the observed counts at z ~ 3, and turns out to provide a good 
match to observations over the whole range 3 < z < 6.6. At higher 
redshifts, no empirical calibration of / CS c is available. Since we do 
not have a detailed physical model for / eac , we cannot exclude the 
possibility that the value at very high redshifts is different from the 
value at lower redshifts. As might be expected, the number counts 
at a given flux are quite sensitive to the value of / esc . Predicted 
counts for other values of / esc than those shown in Fig[S|can easily 
be obtained by scaling the curves in the horizontal direction with 
/e S c, since the flux from each galaxy is proportional to / eB c- 

The lower panel of Fig|8| shows how the predicted number 
counts depend on the assumed reionization redshift z rc ion • As was 
discussed in Section |2| observations of Gunn-Peterson absorption 
troughs in QSO spectra suggest that the IGM became fully reion- 
ized at z ~ 6 l Becker et al. 2001 ), while the WMAP measurement 
of the polarization of the microwave background implies that the 
IGM has been mostly reionized since z ~ 20 I Kogut et al. 120031) . 
We therefore show predictions for z rc ion = 6.5 and 20, in addition 
to our standard value z rc ion = 10. In all three cases we assume 



/csc = 0.02. In our model, reionization is assumed to affect galaxy 
formation in the following way: the IGM is assumed to be instanta- 
neously reionized and reheated at z — z rc i on , and at z < z rc i on , the 
thermal pressure of the IGM is assumed to prevent gas collapsing 
in all dark halos with circular velocities V c < 60km s" 1 . This sim- 
ple behaviour is an approximation t o wha t was found in more de- 
tailed calculations by Bens on et al. 1120021) . The dependence of the 
number counts on z r cion shown in Fig[S|results entirely from this 
effect of reionization on galaxy formation, since we ignore the IGM 
opacity here. For z > z rc i on , all models look identical to the case 
in which the IGM never reionized, but for z < z Ic i on , the number 
counts are suppressed relative to the no-reionization case. We see 
that the predicted counts for the redshift ranges and fluxes shown 
in Fig[8]differ only slightly for z r cion = 6.5 and z r cion — 10, but 
for z r cion = 20, the predicted counts are much lower, except for 
the bright counts at z ~ 8 — 10. 

However, as noted earlier, there is an important caveat to our 
results: our model includes the attenuation of the stellar continuum 
light from galaxies due to Lya scattering and Lyc absorption by 
atomic hydrogen in the intervening IGM, but we do not include 
any attenuation of the flux in the Lya line due to Lya scattering 
in the IGM. At z > z ro ion, when the IGM is completely neu- 
tral, this attenuation of the Lya line flux could potentially be very 
large, which would greatly decrease the number counts at a given 
flux. IGM attenuation will therefore produce a trend in the number 
counts with 8 r eion in the opposite sense to the feedback of reioniza- 
tion on galaxy formation: at z < Zrcion, the feedback effect tends 
to suppress the counts, but the IGM attenuation will be removed, 
which will increase the counts relative to a model in which the IGM 
is still neutral at that redshift. 

The amount of attenuation of the Lya line by the IGM before 
reionization is theoretically very uncertain. iMiralda-Escudd 1 1998) 
showed that for a galaxy at high redshift embedded in a neutral 
IGM moving with the Hubble flow, emitting a Lya line centred at 
its rest-frame wavelength in the frame of the galaxy, scattering by 
atomic hydrogen in the IGM would suppress the blue wing of the 
Lya line completely (reducing the line flux by a factor 2), and also 
partly suppress the red wing due to the Lya damping win gs (re- 
ducing the line flux even more). However. iMadau & Rees 1 1 2000 ) 
and Ha imanl <2002l) showed that this attenuation of the line flux 
could be greatly reduced due to the galaxy photo-ionizing the IGM 
around it. A more detailed theoretical analysis of the attenuation 
has been made by Santos (2004), who includes the following ef- 
fects: (i) the intrinsic width of the Lya line emitted by the galaxy, 
and the fact it may be redshifted in the galaxy rest-frame due to 
scattering in a galactic wind; (ii) the non-uniform density profile of 
the IGM around the galaxy and the departure of the velocity field 
from the Hubble flow, due to cosmological infall onto the galaxy; 
(iii) collisional ionization of the gas within the galaxy halo, photo- 
ionization of the surrounding IGM by the stellar population, and 
clearing of bubbles in the IGM by galactic winds. ISantos I finds 
that a very wide range of attenuation factors is possible in plausi- 
ble models, but that if the Lya emission is redshifted in the rest- 
frame of the galaxy (as is observed to be the case in Lyman-break 
galaxies at z ~ 3), the amount of attenuation is greatly reduced. 
The effects on the attenuation of Lya of clustering of galaxies and 
clump i ng of the IGM have bee n in vestigated bv | Fu rlanettq et al. I 
l2004l)lGnedin & Pradal 120041) . and lWvithe & Loeb I 120051) : thev 
find that these effects can also significantly reduce the amount of 
attenuation. In summary, for LAEs at z > z rc ion, the Lya flux is 
likely to be significantly attenuated by the IGM, but the amount 
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of attenuation is currently uncertain. We plan to investigate this in 
more detail in a future paper. 

A n unsuccessful search for LAEs at z = 8.8 has been carried 
out by IWillis & Courbinl {2005), who place an upper limit n < 
3 x 10~ 3 /i 3 Mpc~ 3 on sources with L Lyct > 10 43 /i~ 2 ergs -1 . 
This limit on the number density is more than 4 orders of mag- 
nitude high er than the prediction for our standard model shown in 
Fiel8tal fwiilis et al. \Em3) and lStark & Ellis N2005I) are carrying 
out surveys for z ~ 9 LAEs which will probe to lower luminosities 
by using gravitational lensing by foreg round clusters. In the near 
future, the DAzLE instrument faorton et al. 120041) on the VLT will 
begin searches for LAEs at z > 7. It is planned to search for Lya 
at z ~ 7.7 in the first phase, and z ~ 8.7 in the second phase, in 
each case over a redshift window Az m 0.01; ultimately it may be 
possible to reach z ~ 14 with the instrument. DAzLE will cover an 
area of AO, = 47 arcmin 2 in a single exposure, and is projected 
to reach a 5<j flux limit of / = 2 x 10~ 18 erg cm _2 s _1 in a 10 
hour integration. We can use our model to predict how many ob- 
jects DAzLE should see in a single 10 hour exposure in each of 
the redshift ranges. For our standard model with z rc i on = 10 and 
/esc = 0.02, we predict the number of sources per unit redshift 
per unit solid angle above flux / = 2 x 10 _18 erg cm _2 s _1 to be 
d 2 N(> f)/dfldz = (0.58, 0.13) arcmin -2 atz = (7.7,8.7) re- 
spectively, which for the specified Az and AQ corresponds to an 
average of 0.3 and 0.06 objects per field at z — 7.7 and z = 8.7 
respectively. Therefore we expect that DAzLE will need to observe 
many separate fields to find a significant number of high-z Lya 
emitters. 



7 CONCLUSIONS 

In this paper, we have used a detailed semi-analytical model of 
galaxy formation based on the ACDM cosmology to predict the 
properties of star-forming Lya-emitting galaxies over the redshift 
range ^ z ^ 20. All except one of the parameters of the model 
were chosen without reference to the observed proper ties of Lya 
emitters, having instead b een chosen in previous work ( Col e et al. I 
l2000tlBaugh et al. 12005 ) to match properties such as the UV, op- 
tical and IR luminosities, sizes, morphological types, gas fractions 
and metallicit ies of galaxies at low and high redshift. As shown in 
iBaugh et al. I l2005t) . our current model, which incorporates a top- 
heavy IMF for stars formed in bursts triggered by galaxy merg- 
ers, provides a good match to the optical and far-IR luminosity 
functions in the local universe, the far-UV luminosity function of 
Lyman-break galaxies at z ~ 3, and the number counts and red- 
shifts of sub-mm galaxies. Our assumption of a top-heavy IMF in 
bursts receives support from studies of the metallicities of ellipti- 
cal galaxies and intracluster gas (Nagashima et al. 2005a 0). The 
one free parameter we had in the comparison of our model with 
observational data on Lya emitters (LAEs) was the fraction / Ba c of 
Lya photons which escape from a galaxy. For simplicity, we have 
assumed that / csc is a constant, irresp ective of ot her galaxy prop- 
erties. In our previous paper on LAEs (Le Delli ou et al. I2005I) . we 
found that / csc = 0.02 reproduced the abundance of faint LAEs at 
z ~ 3, and we used the same va lue in the present paper. 

In iLe Delliou et al. 1 120051) . we presented model predictions 
for the number counts of LAEs as a function of Lya flux for the 
redshift range 2 ^ z ^ 6, but made only a very limited comparison 
with observational data, comparing only with the total counts at the 
limiting fluxes for different surveys. In this paper, we have made 
a much more detailed comparison of the model with observations, 



comparing predicted and observed Lya luminosity functions over 
the range 3 ^ z ^ 7. The most important result of this paper is 
that, with our very simple assumption of a constant Lya escape 
fraction, the model reproduces the observed luminosity functions, 
both in shape and in the evolution with redshift. 

We have also compared the predictions of our model with 
other observed properties of LAEs. We have made a comparison 
of predicted and observed Lya equivalent widths (EWs). We find 
that the typical predicted EWs are similar to those found in ob- 
servational surveys, and that the predicted distribution of EWs at 
a given Lya flux is very broad once we include the effect of dust 
extinction, with a peak at and a tail to large values. If we se- 
lect galaxies in the model according to their continuum magnitudes 
rather than their Lya fluxes, then we predict a distribution of EWs 
which is similar to what has been found observationally for Lyman- 
break galaxies at z ~ 3, when we restrict the comparison to galax- 
ies where Lya is seen in emission. We have compared predicted 
and observed broad-band magnitudes (corresponding to rest-frame 
UV luminosities) for galaxies selected by their Lya fluxes, and find 
mostly good agreement. We have also compared the predicted sizes 
of LAEs with the limited existing observational data, and find rea- 
sonable consistency for the stellar half-light radii. 

We have also used our model to try to better understand the 
nature of the objects selected in Lya emission-line surveys, and 
how they relate to other classes of high-redshift galaxies. We have 
made predictions for the dark halo and stellar masses, the star for- 
mation rates and the gas metallicities for LAEs at different red- 
shifts, properties which are physically fundamental, but for which 
we do not yet have direct observational measurements. The pre- 
dicted halo masses imply values of the clustering bias which seem 
quite consistent with existing measurements of the large-scale clus- 
tering of LAEs. Better observational characterization of the cluster- 
ing of LAEs at different redshifts would provide a very important 
test of our model. 

Finally, we have presented predictions for how many Lya 
emitters should be seen at z > 7, a redshift range for which no 
LAEs have yet been found, but which is now opening up for obser- 
vational study, thanks to advances in near-IR instrumentation. De- 
tection of LAEs in this redshift range would be very exciting both 
for probing the early stages of galaxy formation and the epoch of 
reionization. A problem for making predictions for LAEs at z > 7 
is that the redshift at which the IGM reionized is uncertain, being 
observationally constrained to be in the range 6.5 < z la - lon < 20. 
This affects predictions for galaxies seen at z > z Ic i on , since the 
Lya flux is expected to be significantly attenuated by propagation 
through a neutral IGM, but by an uncertain amount which depends 
on many factors. We have made detailed predictions for how many 
objects could be seen using the DAzLE instrument, which begins 
operation soon, at z ~ 7.7 and z ~ 8.7. We find that, even if the 
attenuation by the IGM is modest at these redshifts, then finding 
LAEs with DAzLE will require observation of a large number of 
fields. 

The most important theoretical limitations of our present work 
are that it does not incorporate a detailed physical model for the 
escape of Lya photons from galaxies, and that we do not include 
a treatment of the attenuation of Lya fluxes by the IGM prior to 
reionization. We plan to address these issues in future papers. 
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